Distributionally Robust Optimization for Sequential Decision Making
نویسندگان
چکیده
The distributionally robust Markov Decision Process approach has been proposed in the literature, where the goal is to seek a distributionally robust policy that achieves the maximal expected total reward under the most adversarial joint distribution of uncertain parameters. In this paper, we study distributionally robust MDP where ambiguity sets for uncertain parameters are of a format that can easily incorporate in its description the uncertainty’s statistical information estimated from historical data. In this way, we generalize existing works on distributionally robust Markov Decision Process with generalized-moment-based ambiguity sets and statistical-distance-based ambiguity sets to incorporate information from the former class such as moments and dispersions to the latter class that critically depend on samples. We show that, under this format of ambiguity sets, the resulting distributionally robust Markov Decision Process remains tractable under mild technical conditions. To be more specific, a distributionally robust policy can be constructed by solving a collection of one-stage convex optimization subproblems.
منابع مشابه
A Practically Efficient Approach for Solving Adaptive Distributionally Robust Linear Optimization Problems
We develop a modular and tractable framework for solving an adaptive distributionally robust linear optimization problem, where we minimize the worst-case expected cost over an ambiguity set of probability distributions. The adaptive distrbutaionally robust optimization framework caters for dynamic decision making, where decisions can adapt to the uncertain outcomes as they unfold in stages. Fo...
متن کاملDistributionally Robust Convex Optimization
Distributionally robust optimization is a paradigm for decision-making under uncertaintywhere the uncertain problem data is governed by a probability distribution that is itself subjectto uncertainty. The distribution is then assumed to belong to an ambiguity set comprising alldistributions that are compatible with the decision maker’s prior information. In this paper,we propose...
متن کاملDistributionally robust chance constraints for non-linear uncertainties
This paper investigates the computational aspects of distributionally robust chance constrained optimization problems. In contrast to previous research that mainly focused on the linear case (with a few exceptions discussed in detail below), we consider the case where the constraints can be non-linear to the decision variable, and in particular to the uncertain parameters. This formulation is o...
متن کاملDistributionally Adaptive Optimization
We develop a modular and tractable framework for solving a distributionally adaptive optimization problem, where we minimize the worst-case expected cost over an ambiguity set of probability distributions. The adaptive optimization framework caters for dynamic decision making, where decisions can adapt to the uncertain outcomes as they unfold in stages. We propose a second-order conic (SOC) rep...
متن کاملTractable Distributionally Robust Optimization with Data
We present a unified and tractable framework for distributionally robust optimization that could encompass a variety of statistical information including, among others things, constraints on expectation, conditional expectation, and disjoint confidence sets with uncertain probabilities defined by φ-divergence. In particular, we also show that the Wasserstein-based ambiguity set has an equivalen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1801.04745 شماره
صفحات -
تاریخ انتشار 2018